
Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 





© Publication number : 0 590 922 A2 



EUROPEAN PATENT APPLICATION 



@ Application number : 93307658.0 
@ Date of filing : 28.09.93 



©Int. CI. 5 : H04N 1/41 



(g) Priority : 02.10.92 US 956128 

@ Date of publication of application : 
06.04.94 Bulletin 94/14 

© Designated Contracting States : 
DE FR GB 

© Applicant : XEROX CORPORATION 
Xerox Square 

Rochester New York 14644 (US) 



(72) Inventor: Eschbach, Reiner 
812 Westwood Trail 
Webster, New York 14580 (US) 

© Representative : Hill, Cecilia Ann et al 

Rank Xerox Patent Department, Albion House, 
55-59 New Oxford Street 
London WC1A 1BS (GB) 



CM 



cr> 
m 



CL 

111 



(54) Improved decompression of standard ADCT-compressed images. 



<g) A method of decompressing a document im- 
age compressed with JPEG ADCT or the like 
method of compression includes : a) receiving 
(300) encoded quantized transform coefficient 
blocks for the original image ; b) removing (302) 
any lossless encoding of the quantized trans- 
form coefficient blocks for the original image ; 

c) multiplying (304) each quantized transform 
coefficient in a block by a corresponding quan- 
tizing value from the quantization table to ob- 
tain a block of received transform coefficients; 

d) recovering the image by applying an inverse 
transform operation (306) to the received trans- 
form coefficients ; e) reducing (308) high fre- 
quency noise appearing in the recovered image 
as a result of the lossy quantization process, 
while preserving edges, whereby the appear- 
ance of the recovered image is rendered more 
visually appealing ; f) changing (310) the filtered 
recovered image into blocks of new transform 
coefficients by the forward transform coding 
operation using the frequency space transform 
compression operation ; g) comparing (312) 
each block of new transform coefficients to a 
corresponding block of received transform 
coefficients and the selected quantization table, 
to determine (314) whether the filtered recov- 
ered image is derivable from the original im- 
age ; and h) upon the determination 
transferring (316,320) the filtered recovered im- 
age to an output buffer. 
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The present invention is directed generally to the 
production of document images and, in particular, to 
a method of improving the appearance of a decom- 
pressed document image while maintaining fidelity 
with an original document image from which it is de- 
rived. 

Data compression is required in data handling 
processes, where too much data is present for prac- 
tical applications using the data. Commonly, com- 
pression is used in communication links, where the 
time to transmit is long, or where bandwidth is limited. 
Another use for compression is in data storage, where 
the amount of media space on which the data is stor- 
ed can be substantially reduced with compression. 
Yet another application for compression is in a digital 
copier where an intermediate storage of data is re- 
quired for collation, reprint or any other digital copier 
function. Generally speaking, scanned images, i.e., 
electronic representations of hard copy documents, 
are commonly large, and thus are desirable candi- 
dates for compression. 

Many different compression techniques exist, 
and many are proprietary to individual users. How- 
ever, standards are desirable whenever intercommu- 
nication between devices will be practiced. Particu- 
larly with the advent of multimedia communication, 
where formerly dissimilar devices are required to 
communicate, a common standard will be required. 
An example is the current desirability of FAX ma- 
chines to be able to communicate with printers. Cur- 
rently, compression standards are generally distinct 
for different devices. 

Three major schemes for image compression are 
currently being studied by international standardiza- 
tion groups. A first, for facsimile type image transmis- 
sion, which is primarily binary, is under study by the 
JBIG (Joint Binary Image Group) committee, a sec- 
ond for TV and film, a standard is worked on by the 
MPEG (Motion Pictures Expert Group). For non-mov- 
ing general images, i.e., still images which are more 
general than the ones covered by JBIG, the group 
JPEG (Joint Photographic Expert Group) is seeking 
to develop a device independent compression stan- 
dard, using an adaptive discrete cosine transform 
scheme. 

ADCT (Adaptive Discrete Cosine Transform, de- 
scribed for example, by W. H. Chen and C. H. Smith, 
in "Adaptive Coding of Monochrome and Color Im- 
ages", IEEE Trans. Comm., Vol. COM-25, pp. 1285- 
1292, November 1977), as the method disseminated 
by the JPEG committee will be called in this applica- 
tion, is a lossy system which reduces data redundan- 
cies based on pixel to pixel correlations. Generally, in 
images, on a pixel to pixel basis, an image does not 
change very much. An image therefore has what is 
known as "natural spatial correlation". In natural 
scenes, correlation is generalized, but not exact. 
Noise makes each pixel somewhat different from its 



neighbors. 

Generally, as shown in Figure 1, the process of 
compression requires a tile memory 10 storing an M 
x M tile of the image. We will use square tiles in the 

5 description based on the JPEG recommendations, 
but it has to be noted that methods in accordance with 
the invention, described below, can be performed 
with any form of tiling. From the portion of the image 
stored in tile memory, the discrete cosine transform 

10 (DCT), a frequency space representation of the im- 
age is formed at transformer 12. Hardware implemen- 
tations are available, such as the C-Cube Microsys- 
tems CL550A JPEG image compression processor, 
which operates in either the compression or the de- 

15 compression mode according to the proposed JPEG 
standard. A divisor/quantization device 14 is used, 
from a set of values referred to as a Q-Table, stored 
in a Q table memory 16, so that a distinct Q table val- 
ue is divided into the DCT value, returning the integer 

20 portion of the value as the quantized DCT value. A 
Huffman encoder 20, using statistical encoding of the 
quantized DCT values, generates the compressed 
image that is output for storage, transmission, etc. 
The current ADCT compression method divides 

25 an image into M x M pixel blocks, where M = 8. The 
selection of M = 8 is a compromise, where the larger 
the block given, the higher the compression ratio ob- 
tainable. However, such a larger block is also more 
likely to have non-correlated pixels within the block, 

30 thereby reducing the compression ratio. If the block 
was smaller, greater correlation within the block 
might be achieved, but less overall compression 
would be achieved. Particularly within a document 
image, edges of the image are more likely to be en- 

35 countered within an 8 x 8 block, than would be the 
case for a scene forming a natural image. Thus, the 
assumption of spatial correlation fails to some extent. 
A major problem addressed by the present invention, 
as will become more apparent hereinbelow, is that the 

40 assumptions of the ADCT proposal work well for pho- 
tographs containing continuous tones and many lev- 
els of gray pixels, but often work poorly for the repro- 
duction of document images, which have significant 
high frequency components and many high contrast 

45 edges. 

Compression schemes tend to use a set of basis 
functions to utilize the intra block correlations. Basis 
functions define the data as a projection onto a set of 
orthogonal functions on an interval. ADCT uses co- 

so sine functions as the basis functions and the Discrete 
Cosine Transform (DCT) as the projection step. In the 
first step of the ADCT standard, the image is tiled into 
8x8 blocks. Within each block, a set of 64 DCT coef- 
ficients is determined for the pixels in the block. The 

55 DCT coefficients represent the coefficients of each 
cosine term of the discrete cosine transform of the 8 
x 8 block. 

Referring now to Figure 2A, an array of 64 gray 
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level values representing 64 pixels in an 8 x 8 block 
of the image is shown. This 8x8 block is transformed 
according to the JPEG ADCT specifications giving 
the DCT coefficients shown in Figure 2B. These coef- 
ficients still completely describe the image data of 
Figure 2A, but in general larger values will now cluster 
at the top left corner in the low spatial frequency re- 
gion. Simultaneously, in the vast majority of images 
as the frequency of the image increases, the coeffi- 
cient values in the lower right hand portion of the grid 
tend towards zero. 

Generally, the human eye tends to see low fre- 
quencies in an image best. At higher frequencies, 
changes from amplitude to amplitude are unnotice- 
able, unless such changes occur at extremely high 
contrast. This is a well known effect of the Human Vis- 
ual System and extensively documented, see e.g. 
"Visual Performance and Image Coding" by P. Roe- 
tling, Proceedings of the S.I.D. 17/2 pp. 111 - 114 
(1976). The ADCT method makes use of the fact that 
small amplitude changes at high frequencies can be 
generally ignored. 

The next step in the ADCT method involves the 
use of a quantization or Q- matrix. The Q- matrix 
shown in Figure 2C is a standard JPEG-suggested 
matrix for compression, but the ADCT method and 
methods in accordance with the present invention, 
described below, can also operate using other Q-ma- 
trices (or Q-Tables). The matrix incorporates the ef- 
fect that lower frequencies are roughly more impor- 
tant than high frequencies by introducing generally 
larger quantization steps, i.e. larger entries, for larger 
frequencies. However, the table also attempts to in- 
ternally construct some desirable variations from the 
general assumption. Accordingly, the values in the ta- 
ble do vary with frequency, where the exact variation 
might be a function of the human visual system, of the 
document type expected, i.e.: photo, text, graphic, 
etc., or of some other application dependent parame- 
ter. Each of the DCT values from Figure 2B is divided 
by a corresponding Q-matrix value from Figure 2C 
giving quantized DCT (QDCT) values by way of: 
QDCT[m][n] = INT{DCT[m][n] -r Q - Table[m][n] 
+ V5>} 

where INT{A} denotes the integer part of A 
The remainder from the division process is dis- 
carded, resulting in a loss of data. Here and in the fol- 
lowing we use the term division to describe the proc- 
ess detailed in ADCT including the methods for han- 
dling truncation and round-off. Furthermore, since 
the Q values in the lower right hand portion of the ta- 
ble tend to be high, most of the values in that area go 
to zero, unless there were extremely high amplitudes 
of the image at the higher frequencies. 

After deriving the quantized set of DCT values, 
shown in Figure 2D, pixels are arranged in the order 
of a space filling zigzag curve and a statistical encod- 
ing method, such as the Huffman process, is used to 



generate the transmitted signal. This statistical cod- 
ing is performed in a lossless way and the only loss 
introduced in the compression is the one generated 
by the quantization of the DCT coefficients using the 

5 Q-Table. 

ADCT transforms are well known, and hardware 
exists to perform the transform on image data, e.g., 
US-A 5,049,991 to Nihara, US-A 5,001,559 to Gon- 
zales et al., and US-A4,999, 705 to Puri. The primary 

10 thrust of these particular patents, however, is moving 
picture images, and not document images. 

To decompress the now-compressed image, and 
with reference to Figure 1, a series of functions or 
steps are followed to reverse of the process descri- 

15 bed. The Huffman encoding is removed at decoder 
50. The image signal now represents the quantized 
DCT coefficients, which are multiplied at signal mul- 
tiplier 52 by the Q table values in memory 54 in a proc- 
ess inverse to the compression process. At inverse 

20 transformer 56, the inverse transform of the discrete 
cosine transform is derived, and the output image in 
the spatial domain is stored at image buffer 58. 

In the described decompression method, Huff- 
man encoding is removed to obtain the quantized 

25 DCT coefficient set. Each member of the set is mul- 
tiplied by a Q-Table value resulting in the DCT coef- 
ficients shown in Figure 3Aby using the data of Figure 
2C and 2D by ways of: 

DCT[m][n] = QDCT[m][n] x Q - Table[m][n]. 

30 However, the result shown in Figure 3A is not the 

original set of DCT coefficients shown in Figure 2B, 
because the remainders calculated for the original 
quantization of the DCT coefficients with the Q-Table 
in the compression process have been lost. In a stan- 

35 dard ADCT decompression process, the inverse dis- 
crete cosine transform of the set of DCT coefficients 
is derived to obtain image values shown in Figure 3B. 

The described process does not work to repro- 
duce the best images. Clearly, it cannot reproduce 

40 the original image, since data within the image was 
discarded in the compression-quantization step. Fail- 
ures are noted wherever strong edges, commonly 
present in text, appear. Particularly, at such edges 
"ringing artifacts" or in some references, "mosquito 

45 noise" is noted. These problems occur in text, graph- 
ics and halftones, components very common in docu- 
ment images. In addition to mosquito noise or ringing 
artifacts, a blocking artifact often appears associated 
with image areas with slowing varying grays, where 

so each MxM block which formed the calculation of the 
compression basis appears visible. In either case, a 
problem has occurred. 

In order to remove the artifacts noted, two meth- 
ods of attacking the problem have been attempted. In 

55 a first method, the decompressed image is post proc- 
essed, i.e., after the image has been fully decom- 
pressed, an attempt is made to improve the image. Of 
course, such processing can never go back to the 
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original image, because that image has been tost. 
Such processes are demonstrated in the article "Re- 
duction of Blocking Effects in Image Coding" by Re- 
eve, III et al. t Optical Engineering, January/February, 
1 984, Vol. 23, No. 1 , p. 34, and "Linear Filtering for Re- 5 
ducing Blocking Effects in Orthogonal Transform Im- 
age Coding", by C. Avril et al., Journal of Electronic 
Imaging, April 1992, Vol. 1(2), pp. 183-191. However, 
this post- processing of the image leads to a recon- 
struction that could not have been the real source im- 10 
age and subsequent compression/decompression 
steps as they are possible in electronic imaging ap- 
plications will lead to potentially larger and larger de- 
viations between reconstruction and original. 

Another approach to the problem is through an 15 
iterative decoding process using the known band Mm it 
of the data. In this method, using the compressed 
form of the image again, different blocks, perhaps 32 
x 32, are used to decode the image. In one example, 
"Iterative Procedures for Reduction of Blocking Ef- 20 
fects in Transform Image Coding", by R. Rosenholtz 
etal., SPIE, Vol. 1452, Image Processing Algorithms 
and Techniques II, (1991), pp. 116-126, a method of 
blurring the overall image was considered, with the 
hope that such blurring would tend to smooth out the 25 
block artifacts noted above. 

In accordance with the invention, there is provid- 
ed a method of decompressing a compressed docu- 
ment image, wherein the decompressed image is fil- 
tered, and the filtered image is compared to the set 30 
of images derivable from the original image, before 
compression, to reduce image noise and assure that 
the decompressed image is within a known range of 
images. 

More particularly, the present invention provides 35 
a method of improving the appearance of a decom- 
pressed document image while maintaining fidelity 
with an original document image from which it is de- 
rived, wherein for compression, an original document 
image is divided into blocks of pixels, said blocks of 40 
pixels are changed into blocks of transform coeffi- 
cients by a forward transform coding operation using 
a frequency space transform operation, said trans- 
form coefficients subsequently quantized with a los- 
sy quantization process in which each transform coef- 45 
ficient is quantized according to a quantizing value 
from a quantization table and the result is used as a 
quantized transform coefficient, the method includ- 
ing the decompression steps of: 

a) receiving said quantized transform coefficient so 
blocks for said original image; 

b) dequantizing the transform coefficients in a 
block according to a corresponding quantizing 
value from the quantization table to obtain a block 

of received transform coefficients; 55 

c) recovering the image by applying an inverse 
transform operation to the received transform 
coefficients; 



d) with a selected non-linear filter, reducing high 
frequency noise appearing in the recovered im- 
age as a result of the lossy quantization process, 
while preserving edges, whereby the appear- 
ance of the recovered image is rendered more 
visually appealing; 

e) changing the filtered recovered image into 
blocks of new transform coefficients by the for- 
ward transform coding operation using the fre- 
quency space transform compression operation; 

f) comparing each block of new transform coeffi- 
cients to a corresponding block of received trans- 
form coefficients and the selected quantization 
table, to determine whether the filtered recov- 
ered image is derivable from the original image; 
and 

g) upon said determination transferring the fil- 
tered recovered image to an output buffer. 

In accordance with one aspect of the invention, 
there is provided a method of improving the appear- 
ance of a decompressed document image while main- 
taining fidelity with an original document image from 
which it is derived, wherein for compression, an orig- 
inal document image is divided into blocks of pixels, 
the blocks of pixels are changed into blocks of trans- 
form coefficients by a forward transform coding op- 
eration using a frequency space transform compres- 
sion operation, the transform coefficients subse- 
quently quantized with a lossy quantization process in 
which each transform coefficient is divided by a quan- 
tizing value from a quantization table and the integer 
portion of a result is used as a quantized transform 
coefficient, and the blocks of quantized transform 
coefficients are encoded with a lossless encoding 
method, the method including the decompression 
steps of: a) receiving the encoded quantized trans- 
form coefficient blocks for the original image; b) re- 
moving any lossless encoding of the quantized trans- 
form coefficient blocks for the original image; c) mul- 
tiplying each quantized transform coefficient in a 
block by a corresponding quantizing value from the 
quantization table to obtain a block of received trans- 
form coefficients; d) recovering the image by apply- 
ing an inverse transform operation to the received 
transform coefficients; e) with a selected filter, reduc- 
ing high frequency noise appearing in the recovered 
image as a result of the lossy quantization process, 
while preserving edges, whereby the appearance of 
the recovered image is rendered more visually ap- 
pealing; f) changing the filtered recovered image into 
blocks of new transform coefficients by the forward 
transform coding operation using the frequency 
space transform operation; g) comparing each block 
of new transform coefficients to a corresponding 
block of received transform coefficients and the se- 
lected quantization table, to determine whether the 
filtered recovered image is derivable from the original 
image; and h) upon the determination transferring 
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the f Btered recovered image to an output buffer. 

In accordance with another aspect of the inven- 
tion, in such a decompression process, step g) may 
include the additional steps of: 1) determining that 
the block of new transform coefficients is not deriva- 
ble from the original image; and 2) altering individual 
new transform coefficients, so that a block of altered 
new transform coefficients is derivable from the orig- 
inal image, 3) recovering the image from the blocks 
of altered new transform coefficients. 

In accordance with still another aspect of the in- 
vention, the steps e-g may be repeated iteratively un- 
til a selected condition occurs. Possible selected con- 
ditions are i) a number of iterations, ii) no corrections 
are required to the transformed image, or iii) no 
changes have been made in successive iterations to 
the inverse transformed image. 

By altering the decompression process, as op- 
posed to altering the compression process, integrity 
and compatibility with the JPEG standard process 
are maintained independent on the number of com- 
pression/decompression cycles the document image 
data undergo. Additionally, the process can be selec- 
tively used, based on the image input, and on whether 
improvements can be made using the decompression 
process. 

By way of example only, embodiments of the in- 
vention will be described with reference to the accom- 
panying drawings, in which: 

Figure 1 (already described) shows a functional 
block diagram for the prior art ADCT compres- 
sion/recompression process; 
Figure 2A (already described) shows an 8 x 8 
block of image data to be compressed; Figure 2B 
(already described) shows the discrete cosine 
values as determined, giving a frequency space 
representation of the image of Figure 2A; Figure 
2C (already described) shows the default Q-Ta- 
ble used in the examples; and Figure 2D (already 
described) shows the quantized discrete cosine 
values as determined; 

Figure 3 A (already described) shows the DCT 
values regenerated from the data of Figure 2A by 
use of the Q-Table of Figure 2C, and Figure 3B 
(already described) shows the corresponding 8 x 
8 reconstructed image data block. 
Figure 4 shows the principle of a single set of 
quantized ADCT values representing a class of 
distinct and possible images; 
Figures 5A-C show numerical examples for the 
case of a multiple source image being represent- 
ed by a single set of quantized ADCT value; 
Figure 6 shows the effect of the edge- preserving 
lowpass filter used in the examples; 
Figure 7A shows one exemplar image block along 
with its neighborhood and the filter context; Fig- 
ure 7B shows the results of the filtering process 
on the image; Figure 7C shows the DCT trans- 



form of the image, the circles illustrating that the 
image is not within the set of possible images; 
Figure 7D shows an alteration to the DCT trans- 
form of the image; and Figure 7E shows the in- 
5 verse transform of the altered DCT transform; 

Figure 8 shows a graphical representation of Fig- 
ure 7, where the filtered image lies outside the 
set of images with identical ADCT representa- 
tions; and 

10 Figure 9 shows a flow chart of an iterative decom- 

pression process. 

Referring now to the drawings, we note initially, 
that while it is impossible to return to the exact image 
which was compressed originally in the compression 

15 process, because data has been lost in the compres- 
sion process, it is possible to return to an image which 
is similar in some respect to the original compressed 
image, as will be further described hereinafter. Sec- 
ondly, it is possible to correct the basic image defects 

20 that are appearing in the image. With reference now 
to Figure 4, a general overview of the compres- 
sion/decompression process is shown. There exists a 
set of images which are distinct from each other, but 
which are similar in the respect that each image in the 

25 set compresses to the same ADCT representation. 
Therefore, any decompression process should pro- 
duce an output image which is within this set. The 
knowledge of the set of possible images is coded by 
the Q-Table used. Since the Q-Table represents div- 

30 isors of the discrete quantized transform coefficients, 
and as a result of the quantization process fractional 
portions of each coefficient are discarded, then the 
set of possible images represents all those images 
from which the same quantized transform coeffi- 

35 cients can be determined about a range of possible 
coefficient values for each term of the transform. 

With reference now to Figure 5A, there is shown 
a set of possible source images 100, 102, 104 and 
106, each consisting of image signals having a gray 

40 density value, ranging between 0 and 256 for the ex- 
ample, followed by their corresponding DCT coeffi- 
cients as a result of the DCT conversion (for this illus- 
tration, illustrated as the DCT 108. These images rep- 
resent portions of document images, generated by 

45 scanning an original document with an input scanner 
or created an electronic document on a computer, etc. 
As can be seen, the images are distinct and the DCT 
coefficients 110, 112, 114 and 11 6 shown in Figure 5B 
are distinct. The DCT coefficients are quantized at 

so quantization 118 using the corresponding entries in 
the Q-Table 119 shown in Figure 5C. In this example 
the top left entry of the DCT coefficients is divided by 
the top left entry [16] in the Q-Table. Using a rounding 
operation for the fractional part, the result for those 

55 coefficients are 

in set 110, 157/16 = 9.81 rounds to 10, 

in set 112, 160/16 = 10 

in set 114,163/16 = 10.19 rounds to 10, and 
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in set 116, 162/16 = 10.13 rounds to 10. 
All of the top left entries in the table of DCT coeffi- 
cients are therefore mappable to the same quantized 
DCT coefficients (set 120 shown in Figure 5C) using 
this Q-Table. The same is true for all other DCT coef- 5 
f icients shown in Figure 5A. The compressed data of 
set 120, therefore describes a set of possible source 
images rather than a unique source image with a sub- 
set of those possible source images shown in Figure 
5A. The determination that an 8 x 8 image block is a 10 
possible source of the quantized DCT coefficients 
can be derived by considering the fact that the Q-Ta- 
ble entries define the quantizers and therefore the 
accuracy of the DCT coefficients. In the example giv- 
en in Figure 5B, the top left entry is bounded by 153 15 
^ entry ^ 168, spanning 16 values, any value in that 
range can be used as DCT coefficient without altering 
the compressed data. It is this "non-uniqueness" that 
is utilized in the method as described below, by se- 
lecting a possible source image that is i) a possible 20 
source image conforming with the compressed data 
and ii) an image that conforms to the model of a docu- 
ment image. In this way the method differs from pre- 
vious methods in which the ultimate image derived, 
does not conform to the compressed data as in post 25 
filtering methods, or does not use a source image 
model to restrict the decompression, but rather used 
a sampling consideration to blur the image. 

In order to define an expectation for the input 
document image, it is noted that the image problem 30 
that results from the decompression process is ring- 
ing, or high frequency noise. Elimination of the high 
frequency noise is desirable, requiring a low pass fil- 
ter. Unfortunately, the ringing occurs at edges, which 
are very common and important in document images. 35 
Accordingly, since edges represent high frequencies, 
which should not be removed, a simple low pass filter 
is not adequate, because it would destroy the edge in- 
formation, i.e.: the readability of characters or the 
definition of lines, along with the reduction of noise. 40 
Rather, a low pass filter that preserves edges is re- 
quired. Such a filter is a non-linear filter which might 
be an order statistical filter (or median filter), a sigma 
filter or the like. 

In our present example and with reference to Fig- 45 
ure 6, the sigma filter operates on the image using a 
noise value or delta of A = 32 in this example. The pix- 
el with value 231 is surrounded by pixels that do not 
deviate from this value by more than A, and therefore, 
an output value 218 is derived by averaging over all so 
9 pixels in a 3 x 3 area. The pixel with value 40 is sur- 
rounded by pixels within the range as well as by pixels 
exceeding the range, resulting in an output value 55 
generated by averaging over 3 pixels. 

With reference now to Figure 7A, a set of decom- 55 
pressed pixels and their gray values, in an 8 x 8 pixel 
block within a 10 x 10 pixel neighborhood is indicated 
by the black lines segregating the neighborhood from 
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the remainder of the image, is shown. At borders of 
the image, we simply replicate pixels, and in Figure 
7Athis is represented in the leftmost column and the 
bottom row. The sigma filter is represented as an 
averaging function performed over pixels in a 3 x 3 
neighborhood that do not deviate from the center pix- 
el by more than a value of 32. In Figure 7A, values 
155, 96, and 141 (all circled), deviate from the center 
pixel 206 (identified by a square) by more than a value 
of 32 (the noise value or A). Applying this filter, a 
modified image is obtained from the gray level pixels, 
given in Figure 7B and resulting in a new value of 208 
(identified by a square) for the pixel. 

Having obtained a filtered image, which has atte- 
nuated the high frequency noise within the decom- 
pressed image, a comparison to the original image is 
obtained to assure fidelity with the original image. 
The DCT of the image is derived, shown in Figure 7C. 
The results are directly compared with the original 
compressed image (compared, for example to DCT 
coefficient sets 110, 112, 114, and 116, and Q-Table 
119, of Figures 5B and 5C). Each position in the DCT 
set has a range of acceptable values, and in the ex- 
ample of Figure 7C, the value at position 3,1 (value 
of 74 circled) is beyond the range of acceptable val- 
ues earlier determined to be 75 to 84 by means of 8 
= INT{acceptable value + 10 + V2}, where '8' is the 
quantized DCT coefficient and '10' is the corre- 
sponding Q-Table entry. Accordingly, value 75 is sub- 
stituted for value 74. In other words, the transform 
values have been altered so that the image which 
was outside the set of possible images, is now just 
within that set (see Figures 5Aand 5B). While a value 
just inside the acceptable range is selected in this ex- 
ample, experience may show that values at other 
positions in the range might also be suitable substi- 
tutes for out-of-range values. Figure 7D shows the 
corrected DCT coefficient at position 3,1 with an ad- 
ditional modified entry at position 1 , 4 (circled). From 
Figures 7C and 7D it is clear, that a simple filtering of 
a decompressed image does not guarantee that the 
result of the filtering operation is an image that was 
a possible source image for the compressed data, 
thus generating an image that is in violation of the 
DCT data. The inverse transform of the altered DCT 
set is derived, and a set of gray image pixels is again 
obtained as shown in Figure 7E. All pixels in Figure 7E 
that have been altered in reference to Figure 7B are 
again circled. It should be noted that the values given 
in Figure 7E are the values of a possible original im- 
age, whereas two values given in Figure 7B were in 
violation of acceptable DCT ranges. 

The process of filtering the image, transforming 
to frequency space, comparing and altering the fre- 
quency space values, and retransforming back to 
gray image pixels may be reiterated a number of 
times. The number of iterations can be selected on a 
basis of no changes in the DCT transform (i.e., there 
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is convergence). Alternatively, the process may be it- 
erated until there are no further changes in the image 
(another type of convergence). In yet another alter- 
native, the number of iterations set to a fixed number, 
or a fixed number can serve as an upper limit of iter- 
ations. 

In the iterative process used, the noise estimate 
used in the filter may be changed for each iteration. 
That is, instead of assuming A = 32 and using that val- 
ue for each iteration, values of two thirds of A in the 
second iteration, and one third A in third iteration for 
a three iteration case, might be used. We also might 
filter and sharpen (applying a different filter) in se- 
quential operations. 

Now the image is a new one. First, it is noted that 
this image is a possible original image, which could 
have resulted in the compressed original image. 
Therefore, there is some fidelity to the original image. 
The image has been smoothed at least once to re- 
move high frequency noise while enhancing or main- 
taining edges. The correction may have introduced 
changes in the image, but a compressed version of 
the filtered image has been compared to the original 
compressed image (the original possible set) to as- 
sure fidelity. The filtered image was corrected as re- 
quired. It is assumed that the corrected filtered image 
is better than the filtered image because it is in com- 
plete agreement with the range of possible images. 

Figure 8 illustrates the principal of the method. 
The original image is compressed; the compressed 
representation is decompressed. The decompressed 
image is filtered to improve appearance, but in doing 
so, it is forced outside the range of images that are ac- 
ceptable. The DCT representation of the image is 
therefore altered, in order to force the image into the 
acceptable range of images. 

With reference now to Figure 9, a flow chart of 
the iterative ADCT decompression/reconstruction is 
provided showing the additional operations of the 
method in accordance with the present invention. An 
image compressed in accordance with the ADCT 
compression method with statistical encoding is ob- 
tained at step 300. The statistical encoding is re- 
moved at step 302 to obtain quantized DCT coeffi- 
cients. At step 304, the quantized DCT coefficients 
are multiplied by values in the Q table to obtain the 
set of DCT coefficients. At step 306, the inverse trans- 
form of the DCT coefficients is derived to produce the 
gray level image. Deviating from the normal process, 
at step 308 the 8x8 output block is filtered. At step 
310, the filtered image output is used to generate a 
set of DCT coefficients. At step 312, the filtered im- 
age DCT coefficients are compared to the DCT coef- 
ficients obtained at step 312, and an acceptable 
range about each value. At step 314, if the filtered im- 
age DCT coefficients are within the acceptable range 
about each value, then at step 316, the inverse trans- 
form of the DCT coefficients is derived to produce the 
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gray level image. At step 320, the gray image is direct- 
ed to an output. If the filtered image DCT coefficients 
are not within the acceptable range about each value, 
then at step 322, acceptable values are substituted 

s for out-of-range values, and the process is repeated 
from step 306. Not shown, a counter may be incre- 
mented to limit the number of iterations. If, at step 324 
the limit of the iteration number is reached, the data 
is transferred to block 316 for subsequent output 

10 through block 320. This step guarantees that the out- 
put gray level image is a valid image for the com- 
pressed description received at block 300. 

It should be noted that one specific filter type and 
one specific conflict resolution for the DCT coeffi- 

15 cientswas used in the examples. Clearly, different fil- 
ters are possible, including general order statistic fil- 
ters and f ind-and-replace" filters. Also, the filter can 
be based on the local characteristics of the 8x8 
block. An example is an edge preserving smoothing 

20 filter that uses a fraction of the dynamic range of the 
local 8x8 blocks, and/or the Q-matrix to determine 
the maximum noise variation encountered in the 
block. Another option for the manipulation in the im- 
age domain, in contrast to the DCT domain, is the 

25 thresholding of data if a binary original is assumed. 
This assumption, however, has to be tested in a DCT 
domain and the thresholding should only be accepted 
if the DCT coefficients after thresholding are in 
agreement with the DCT coefficients before thresh- 

30 olding. 

Different options exist for the conflict resolution 
of the DCT coefficients. Convergence speed can be 
improved by modifying the algorithm slightly. Possi- 
ble modifications include, e.g., the addition of noise 

35 into the modified DCT coefficients. Finally, the num- 
ber of iterations can be a function of the local 8x8 
block characteristics rather than using a fixed num- 
ber as used in the examples. 

40 

Claims 

1. A method of improving the appearance of a de- 
compressed document image while maintaining 

45 fidelity with an original document image from 

which it is derived, wherein for compression, an 
original document image is divided into blocks 
(Fig. 2A) of pixels, said blocks of pixels are 
changed into blocks (Fig. 2B) of transform coef- 

so f icients by a forward transform coding operation 

using a frequency space transform operation, 
said transform coefficients are subsequently 
quantized with a lossy quantization process in 
which each transform coefficient is quantized ac- 

55 cording to a quantizing value from a quantization 

table (Fig. 2C) and the result (Fig. 2D) is used as 
a quantized transform coefficient, the method in- 
cluding the decompression steps of: 
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a) receiving (300) said quantized transform 
coefficient blocks for said original image; 

b) dequantizing (304) the transform coeffi- 
cients in a block according to the quantization 
table (11 9) to obtain a block of received trans- 5 
form coefficients; 

c) recovering the image by applying an in- 
verse transform operation (306) to the re- 
ceived transform coefficients; 

d) with a selected filter, reducing (308) high 10 
frequency noise appearing in the recovered 
image as a result of the lossy quantization 
process, while preserving edges, whereby 

the appearance of the recovered image is 
rendered more visually appealing; 15 

e) changing the filtered recovered image into 
blocks of new transform coefficients by the 
forward transform coding operation (310) us- 
ing the frequency space transform compres- 
sion operation; 20 

f) comparing (312) each block of new trans- 
form coefficients (Fig. 7C) to a corresponding 
block of received transform coefficients and 
the selected quantization table (Figs. 5B and 

5C), to determine (314) whether the filtered 25 
recovered image is derivable from the original 
image; and 

g) upon said determination transferring 
(316,320) the filtered recovered image to an 
output buffer. 30 



A method as claimed in any one of the preceding 
claims, wherein for compression, a quantized 
transform coefficient is produced by dividing (14) 
each transform coefficient by a quantizing value 
from the quantization table, the integer portion of 
the result being used as the quantized transform 
coefficient; and wherein the step of dequantizing 
the quantized transform coefficients comprises 
multiplying (52) each quantized transform coeffi- 
cient by a corresponding quantizing value from 
the quantization table. 



A method as claimed in claim 1, wherein step f) 
includes the additional steps of 

1) determining that the block of new trans- 
form coefficients is not derivable from the 35 
original image; and 

2) altering (322) individual new transform 
coefficients, so that a block of altered new 
transform coefficients (Fig. 7D) is derivable 
from the original image; 40 

3) recovering the image (Fig. 7E) from the 
blocks of altered new transform coefficients. 



3. A met hod as claimed in claim 2, including repeat- 
ing steps d-f iteratively until a selected condition 
occurs. 



45 



A method as claimed in any one of the preceding 
claims, wherein the forward transform coding op- 
eration using the frequency space transform op- 
eration is a discrete cosine transform. 



50 



5. A method as claimed in any one of the preceding 
claims, wherein, for compression, the quantized 
transform coefficients are further encoded (20) 
with a lossless encoding method, and the addi- 
tional step (302) of removing any lossless encod- 
ing is provided prior to step b). 



55 
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DOT COEFFICIENTS (MULTIPLY FIG. 2C VALUES BY FIG. 2D) 
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